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ABSTRACT 



We calculate the cosmic Mach number M - the ratio of the bulk flow of the 
velocity field on scale R to the velocity dispersion within regions of scale R. M is 
effectively a measure of the ratio of large-scale to small-scale power and can be a useful 
tool to constrain the cosmological parameter space. Using a compilation of existing 
peculiar velocity surveys, we calculate M and compare it to that estimated from mock 
catalogues extracted from the LasDamas (a ACDM cosmology) numerical simulations. 
We find agreement with expectations for the LasDamas cosmology at ~ 1.5<r CL. 
We also show that our Mach estimates for the mocks are not biased by selection 
function effects. To achieve this, we extract dense and nearly-isotropic distributions 
using Gaussian selection functions with the same width as the characteristic depth 
of the real surveys, and show that the Mach numbers estimated from the mocks are 
very similar to the values based on Gaussian profiles of the corresponding widths. We 
discuss the importance of the survey window functions in estimating their effective 
depths. We investigate the nonlinear matter power spectrum interpolator PkANN as 
an alternative to numerical simulations, in the study of Mach number. 
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statistics - cosmology: observations - cosmology: theory - 
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1 INTRODUCTION 



|Ostriker fc Suto| ( |1990[ > introduced a dimensionless statistic 
of the cosmological structure - the cosmic Mach number, as 
a way to measure the warmth/coldness of the velocity field 
on some scale R. Specifically, the Mach number is defined 
as a ratio 

M(x ;iW^°^Y /2 (1) 

where u(xo; R) is the bulk flow (BF) of a region of size R cen- 
tered at xo, and cr(xo; R) is the velocity dispersion of the ob- 
jects within this region. The ensemble average over xo gives 
the statistic M(R). Since both |u(xo;-R)| 2 and <r 2 (xo;-R) 
scale equally by the amplitude of the matter density per- 
turbation, the statistic M is independent (at least in linear 
approximation) of the normalization of the matter power 
spectrum. 

In linear theory, given the cosmological parameters, M 



can be readily calculated and compared with its measured 
value from the peculiar velocity field catalogues. However, 
comparing theoretical predictions with observations is not 
straightforward: (i) one has to correct for the small-scale 
nonlinearities in observations as well as take into account the 
fact that observations represent only a discreet sample of the 
continuous velocity field. This can be remedied by smooth- 
ing the velocity field on a suitable scale r s (~ 5fr — 1 Mpc, 
since on larger scales the matter density field is expected 
to be linear), before estimating the quantities u(xo;i?) and 
<t(xo; R). However, any residual nonlinearity in the observed 
field can still bias the M estimates; (ii) Non-uniform, noisy 
and sparse sampling of the peculiar velocity field can lead 
to aliasing of small-scale power onto larger scales. When 
making comparisons with theory, one has to carefully take 
into account the selection function and the noise of the real 
dataset. (iii) Peculiar velocity surveys have only line-of-sight 
velocity information. 

Over two decades ago, the statistic M has been inves- 
tigated in a series of papers: Ostrikcr & Suto (1990) used 
linear theory and Gaussian selection function to show that 
standard Cold Dark Matter (sCDM) model is inconsistent 
(predicts M almost twice the observed value) with obser- 
vations at ~ 95% CL; Suto, Cen, & Ostriker (19921, using 
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Tophat and Gaussian selection functions, studied the distri- 
bution of M using N-body simulations to rule out the sCDM 



scenario at 99% CL; Strauss, Cen, & Ostriker ( 1993 1 took 



into account the selection function of real surveys and ex- 
tracted mocks from numerical simulations over a range of 
cosmologies including sCDM and tilted CDM (scalar spec- 
tral index, n s ^ 1) among others, to reject the sCDM model 
at 94% CL. More recently |Ma, Ostriker, & Zhaol J20121 ex- 



plored the potential of using M in distinguishing cosmologi- 
cal models, including modified gravity and massive neutrino 
cosmologies. 

In this paper, (i) we estimate the cosmic Mach number 
for various galaxy peculiar velocity datasets; (ii) we investi- 
gate how likely it is to get these Mach values in a ACDM uni- 
verse. To achieve this, we study the statistical distribution of 
the expected Mach number by extracting mocks of the real 
catalogues from numerical simulations of a ACDM universe. 
We show that a ACDM universe with 7-yr WMAP type cos- 
mology is consistent with the Mach observations at ~ 1.5a 
CL; (iii) we further show that our M estimates for the mocks 
are not biased by their selection functions. Towards this, we 
extract dense and nearly isotropic distributions with a Gaus- 
sian profile f(r) oc e^ 2 / 2 ^ with R = 10 - 100/i _1 Mpc. We 
show that the Mach numbers estimated from the mocks are 
very similar to the values based on Gaussian profiles (of sim- 
ilar depth R as the mocks); (iv) we use the nonlinear mat- 



ter power spectrum interpolation scheme PkANN ( Agarwal 



et al. 2012a I to check if we can avoid N-body simulations 



completely and predict M(R) by only using PkANN 's pre- 
diction for the nonlinear power spectrum. This is crucial 
because high-resolution hydrodynamic N-body simulations 
are computationally expensive and extremely time consum- 
ing. Exploring parameter space with numerical simulations 
with reasonable computing resources and time might not be 
possible. A full use of a statistic like M can only be realized 
with a prescription for the nonlinear matter power spectrum. 

In Sec. |2] we discuss the cosmic Mach number statis- 
tic. In Sec. |3]we describe the numerical simulations we use 
to extract mock surveys. In Sec. [4] we describe the galaxy 
peculiar velocity surveys (Sec. |4.l| and the procedure we fol- 
low to extract the mock catalogues (Sec. 4.2 1. In Sec. [5] we 
review the maximum likelihood estimate (hereafter MLE) 
weighting scheme that is commonly used to analyze pecu- 
liar velocity surveys. In Sec. |6j we show our results for the 
statistical distribution of the Mach number estimated using 
various mock catalogues. In Sec. [7] we test the performance 
of PkANN - a nonlinear matter power spectrum interpola- 
tor, in predicting M(R) for the LasDamas cosmology. The 
Mach number estimates from real surveys is summarized in 
Sec. [8] We discuss our results and conclude in Sec. [9] 



2 THE COSMIC MACH NUMBER 

Given a peculiar velocity field v(x), one can calculate the 
bulk flow (see Feldman & Watkins 1994 for details), which 
represent the net streaming motion of a region in some di- 
rection relative to the background Hubble expansion. The 
bulk flow u(xo; R) of a region of size R centered at xq can 



be defined as 
u(x ;i?) = / dx v(x)F(jx — x |, R), 



(2) 



where FQx — xo|, R) is the filter used to average the veloc- 
ity field v(x) on a characteristic scale R. Although Tophat 
and Gaussian filters are the preferred choices, F(\x — xo|, R) 
can be designed to mimic the selection function of the real 
datasets. This is useful when dealing with datasets whose 
selection function depends strongly on the position in the 
sky. In Fourier space, Eq. [2] can be written as 



u(x ; R) 



dk v(k)W(k,R)e 



-ik-XQ 



(3) 



where v(k) and W(k,R) are the Fourier transforms of the 
peculiar velocity field v(x) and the filter F(\x — xo|, R), re- 
spectively. 

In linear theory of structure formation, at low redshifts, 
the velocities are related to the matter overdensities via 
k 



v(k) = ifH 8(k) 



fc 2 ' 



(4) 



where Ho is the present-day Hubble parameter in units of 
km s _1 Mpc -1 ; 5(k) is the Fourier transform of the over- 
density field <5(x); the linear growth rate factor / can be 
approximated as / = f^ 55 (Linder 20051. Thus, the ve- 



locity power spectrum P v (k) is proportional to the matter 
power spectrum P(k) at low redshifts, 



P v (k) = (H fY 



k 2 



(5) 



Using Eq. [3] and Eq. [5j the mean-squared bulk value of 
u(xo; R) can be shown to be 



ol{R) = < u 2 (x ;i?) > = 



Hp Q.ln 

2tt 2 



dk P(k)W 2 (kR), (6) 



where the average is taken over all spatial positions xo. 

The squared velocity dispersion within a region of size 
R centered at xo can be similarly defined as 



g (xo;R) 



dx |v(x)| 2 F(|x-x |,7?) - |u(x ;J?)| 2 . (7) 



In Fourier space, the ensamble average of Eq. [7] over xo 
becomes 



J dk P(k)(l-W 2 (kR)) 



a (R) = < a (x ; R) > = ——^ 

Using Eq. [6] and Eq. ||] the cosmic Mach number can now 
be defined as 



M(R) = < M 2 (x ;fl) > 



1/2 



a 2 OR) 



1/2 



(9) 



As discussed in literature ( Ostriker fc Suto|1990 



Suto, 



Cen, fc Ostrikerlp92l [Strauss, Cen, fc Ostriker||1993t , the 
cosmic Mach number is essentially a measure of the shape of 
the matter power spectrum: The rms bulk flow a v (R) gets 
most of its contribution from scales larger than R, while 
the velocity dispersion <j(R) is a measure of the magnitude 
of velocities on scales smaller than R and gets most con- 
tribution from small scales (for more detailed analyses of 



velocity dispersion see also Watkins (19971; Bahcall & Oh 
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( 1996[ ); [Bahcall et al.| ( |1994| >). Furthermore, the statistic M 
is expected to be independent of the matter power spectrum 
normalization - at least on large scales, where the perturba- 
tions are still well described by linear theory and affect both 
o%(R) and a 2 (R) equally. M can be a powerful tool to test 
not only the ACDM scenario, but also a wide range of cos- 
mologies including models with massive neutrinos. Massive 
neutrinos suppress the matter power spectrum in a scale de- 
pendent way, thereby altering the velocity dispersion much 
more prominently than the bulk flow. Mach number M pro- 
vides an easy to interpret technique to distinguish between 
various cosmological models. 



Table 1. The cosmological parameters and the design specifica- 
tions of the LD- Carmen simulations. 



fn previous work ( Watkins, Feldman, fc Hudson} 
Feldman, Watkins, & Hudson 2010; Agarwal, Feldman 



2009; 
ian, fc| 



Watkins||2012b I we have dealt with peculiar velocity sur- 
veys differently. We developed the 'minimum variance' for- 
malism (hereafter MV) to make a clean estimate of the the 
bulk, shear and the octupole moments of the velocity field 
as a function of scale using the available peculiar velocity 
data. Higher moments get contribution from progressively 
smaller scales. In this paper, instead of isolating the higher 
moments (shear, octupole etc.), we simply estimate the ve- 
locity dispersion a(R) that gets contribution from all scales 
smaller than R. In general, the MLE scheme we employ here 
attempts to minimize the error given a particular survey. 
The downside of the MLE formalism is the difficulty to di- 
rectly compare results from different surveys. Since each sur- 
vey samples the volume differently and although the large 
scale signal is similar across surveys, the small scale noise is 
unique. That leads to complicated biases or aliasing that are 
survey dependent (see |Watkins fc Feldman| ( |1995[ ) ; |Hudson| 



et al. 



( 2000 1) . The MV formalism corrects for small scale 
aliasing by using a minimization scheme that treats the vol- 
ume of the surveys rather than the particular way the survey 
samples the volume, thus eliminating aliasing and allowing 
for direct comparison between surveys. 



3 N-BODY SIMULATIONS 

In order to study the statistical distribution of the cosmic 
Mach number we extract mock surveys from the 41 numer- 
ical realizations of a ACDM universe. The N-body simu- 
lation we use in our analysis is Large Suite of Dark Mat- 



ter S imulations (LasDamas, hereafter LD) (McBridc et al 
McBride et al. 2011 in prejj^j). The LD simulation is a 



2009 



suite of 41 independent realizations of dark matter N-body 
simulations named Carmen and have information at red- 
shift z — 0.13. Using the Ntropy framework (Gardner et al. 
|2007[ ), where bound groups of dark matter particles (halos) 
are identified with a parallel friends-of-friends (FOF) code 



(Davis et al. 1985 1. The cosmological parameters and the de- 
sign specifications of the LD- Carmen are listed in Table [l] 
We extract 100 mock catalogues from each of the 41 
LD- Carmen boxes, for a total of 4100 mocks. The mocks 
are randomly centered inside the boxes. They are extracted 
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Cosmological parameters 


LD- Carmen 


l\ 1 t" f" f^T" f 1 l G l i~~\T \ ) 


25 


Cosrriologiccil constant density, ^ A 


n 7^ 

U. 1 o 


j_>enyuii u.t_,iiai l V; 


0.04 


Hubble parameter, h (100 km s _1 Mpc -1 ) 


0.7 


Amplitude of matter density fluctuations, <jg 


0.8 


Primordial scalar spectral index, n s 


1.0 


Simulation design parameters 




Simulation box size on a side (/i -1 Mpc) 


1000 


Number of CDM particles 


1120 3 


Initial redshift, z 


49 


Particle mass, m p (10 10 h~ 1 M ( r ) ) 


4.938 


Gravitational force softening length, (/i -1 kpc) 


53 



to mimic the radial distribution of the real catalogues (de- 
scribed in Sec. 4.1l, as closely as possible. 



4 PECULIAR VELOCITY CATALOGUES 
4.1 Real Catalogues 

We use a compilation of five galaxy peculiar velocity surveys 
to study the Mach statistic. This compilation, which we label 
'DEEP', includes 103 Type-la Supernovae (SNIa) ( |Tonry| 



et al.||2003| >, 70 Spiral Galaxy Culsters (SC) Tully-Fi sher 
(TF) clusters ( jGiovanelli et al.|p98| [Dale et al.|| 1999a} , 56 
Streaming Motions of Abell Clusters (SMAC) fundamental 



plane (FP) clusters ( |Hudson et aL|l 999 2004 |), 50 Early -type 
Far galaxies (E FAR)TP clus ters ( |Colless et al.||2001[ ) and 
15 TF clusters (|Willick||1999h . In all, the DEEP catalogue 



consists of 294 data points. In Fig. [T] top row, we show the 
DEEP catalogue (left-hand panel) and its radial distribution 
(right-hand panel). The bottom row shows a typical mock 
extracted from the LD simulations. The procedure to extract 
mocks is described in Sec. 14.21 



4.2 Mock Catalogues 

Inside the N-body simulation box, we first select a point at 
random. Next, we extract a mock realization of the real cata- 
logue by imposing the constraint that the mock should have 
a similar radial distribution to the real catalogue. We do not 
constrain the mocks to have the same angular distribution 
as the real catalogue for two reasons: (i) the LD simulation 
boxes are not dense enough to give us mocks that are ex- 
act replicas of the real catalogue, (ii) the objects in a real 
survey are typically weighted depending only on their veloc- 
ity errors. Consequently, even though the real catalogue and 
its mocks have similar radial profiles, their angular distribu- 
tion differ considerably, with the mocks having a relatively 
featureless angular distribution. To make the mocks more 
realistic, we impose a 10° latitude zone- of- avoidance cut. 

Using the angular position {r x ,f y ,r z }, the true radial 
distance d s from the mock center and the peculiar velocity 
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vector v, we calculate the true line-of-sight peculiar veloc- 
ity v s and the redshift cz — d s + v s for each mock galaxy 
(in km/s). We then perturb the true radial distance d 3 of 
the mock galaxy with a velocity error drawn from a Gaus- 
sian distribution of width equal to the corresponding real 
galaxy's velocity error, e. Thus, d v — d s + 84, where d p is 
the perturbed radial distance of the mock galaxy (in km/s) 
and 8d is the velocity error drawn from a Gaussian of width 
e. The mock galaxy's measured line-of-sight peculiar veloc- 



ity v p is then assigned to be v p 



d v , where cz is the 



redshift we found above. This procedure ensures that the 
weights we assign to the mock galaxies are similar to the 
weights of the real galaxies. In Fig. [T] we show the angular 
(left panels) and radial (right panels) distribution of galaxies 
in the DEEP catalogue (top) and its mock (bottom). 



5 THE MAXIMUM LIKELIHOOD ESTIMATE 
METHOD 

One of the most common weighting scheme used in the anal- 
ysis of the bulk flow is the maximum likelihood estimate 
(MLE) method, obtained from a maximum likelihood anal- 



ysis introduced by Kaiser (19881. The motion of galaxies is 
modeled as being due to a streaming flow with Gaussian 
distributed measurement uncertainties. Given a peculiar ve- 
locity survey, the MLE estimate of its bulk flow is obtained 
from the likelihood function 



L[ui\{S„,a n ,a- t }] = J~| 



: exp 



uiY 



(10) 

where r n is the unit position vector of the nth galaxy; a n 
is the measurement uncertainty of the nth galaxy; and er* 
is the 1-D velocity dispersion accounting for smaller-scale 
motions. The three components of the bulk flow m can be 
written as weighted sum of the measured radial peculiar 
velocities of a survey 



(ii) 



where S„ is the radial peculiar velocity of the nth galaxy of a 
survey; and Wi,„ is the weight assigned to this velocity in the 
calculation of m. Throughout this paper, subscripts i,j and 
k run over the three spacial components of the bulk flow, 
while subscripts m and n run over the galaxies. Maximizing 
the likelihood given by Eq. |10| gives the three components 
of the bulk flow m with the MLE weights 



j=i 



where 



E 



(12) 



(13) 



The 1-D velocity dispersion <r* is l/-\/3 of the 3-D ve- 
locity dispersion (see Eq. [8| which we aim to ultimately 
measure. Since the weights Wi, n (and m) are themselves a 
function of <r*, we converge on to the MLE estimate for er* 



DEEP-survey 



-200 
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x (Mpc/h) 
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r (Mpc/h) 



Figure 1. Top row: DEEP catalogue (left) and its radial distri- 
bution (right). Bottom row: DEEP mock catalogue (left) and its 
radial distribution (right). 



iteratively. See Strauss, Cen, & Ostriker ( 1993 I for a discus 



sion on how to estimate the best-fit u% and a*. 

The effective depth of a survey can be roughly estimated 
by a weighted sum ^ w n r n / ^ w n of the radial distances r„ 
of the survey objects, where w„ = l/(cr^ + at). This weight- 

: Zhaol d20T2b in 



ing scheme has been used by |Ma, Ostriker, 
their analyses of peculiar velocity datasets. A drawback of 
using weights w„ = l/(cx^ + cr*) in estimating the depth of 
a survey is that while the weights w n take into account the 
measurement errors a n , they do not make any corrections 
for the survey geometry. A better estimate of the effective 
depth can be made by looking at the survey window func- 
tions Wij . Window function gives an idea of the scales that 
contribute to the bulk flow estimates. Ideally, the window 
function should fall quickly to zero for scales smaller than 
that being studied. This ensures that the bulk flow estimates 
are minimally biased from small-scale nonlinearities. 

Armed with the MLE weights Wi : „ from Eq. |12| the 
angle-averaged tensor window function Wfjik) (equivalent 
to W 2 (kR) of Eq. [r^ can be constructed (for details, see 
Feldman, Watkins, fc Hudson|2010 1 as 



x exp (^ik k ■ (r„ 



The diagonal elements Wu are the window functions of 
the bulk flow components Ui. The window function gives an 
idea of the scales that contribute to the bulk flow estimates. 
Ideally, the window function should fall quickly to zero for 
scales smaller than that being studied. This ensures that 
the bulk flow estimates are minimally biased from small- 
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Figure 2. Left Panel: The window functions WS of the bulk 
flow components calculated using MLE weights for the DEEP 
catalogue. The x, y, z components are dot-dashed, short-dashed 
and long-dashed lines, respectively. The solid lines are the ideal 
window functions I Wfj for scales R = 10 — 40/i -1 Mpc (in 
5/i Mpc increments), the window functions being narrower for 
larger scales. Right Panel: The window functions for a subset of 
the 4100 DEEP mocks (solid lines). The characteristic depth of 
the DEEP catalogue and its mocks is ~ R = 35/i Mpc (dashed 
line). 



scale nonlinearities. See the MLE (Sarkar et al. 2007[ ) and 
MV ( Watkins et al.|2 009) window functions of the bulk flow 
components for a range of surveys. 

Having constructed the survey window functions Wfi, 
the effective depth of the survey can be denned to be the 
one for which Wu is a close match to the window function for 
an idealized survey. In order to construct the ideal window 
functions, we first imagine an idealized survey containing 
radial velocities that well sample the velocity field in a re- 
gion. This survey consists of a large number of objects, all 
with zero measurement uncertainty. The radial distribution 
of this idealized survey is taken to have a Gaussian profile 
of the form fir) oc e~ r ^ 2R , where R gives a measure of 
the depth of the survey. This idealized survey has easily in- 
terpretable bulk flow components that are not affected by 
small-scale aliasing and which reflect the motion of a well- 
defined volume. 

The MLE weights of an ideal, isotropic survey consist- 
ing of N' exact radial velocities v n i measured at randomly 
selected positions r' n , are 



I I ■ 

n ,j 

N' ■ 



where 



jv' 



(15) 



(16) 



Similar to Eq. |l4| the window functions ! Wfj for an 
idealized survey of scale R can be constructed as 



47T 



(?' m • k) ■ t)(i7) 



x exp (ik k • (r' m — r'„)J 



In Fig. [2] left-hand pan el, w e show the diagonal win- 
dow functions Wfi (see Eq. 141 of the bulk flow compo- 



nents calculated using MLE weights (see Eq. 12 1 for the 
DEEP catalogue. The x, y, z components are dot-dashed, 
short-dashed and long-dashed lines, respective ly. A lso shown 
are the ideal window functions I W% (see Eq. 17 1 for scales 



R = 10 — 40/i _1 Mpc (in 5/t _1 Mpc increments), the window 
functions being narrower for larger scales. Comparing the 
DEEP and the ideal window functions gives the DEEP cat- 
alogue an effective depth of ~ R = 35/i~ 1 Mpc. We note 
that the weighted sum ^ w n r n / ^ w n gives the DEEP cat- 
alogue a depth of 59 h~ 1 Mpc, an over-estimation by nearly 
70%. Estimating the survey depth correctly is crucial when 
it comes to comparing the survey bulk flow with theoreti- 
cal predictions. One might have a high-quality survey but 
a poorly estimated depth which can introduce substantial 
errors when comparing with theory. Throughout this paper, 
we define the characteristic depth R of a survey as the one 
estimated from its window functions. The right-hand panel 
of Fig.[2]shows the window functions for a subset of the 4100 
DEEP mocks (solid lines) extracted from the LD- Carmen 
simulations. The fact that the mock window functions are 
nearly centered on the ~ R = 35/i _1 Mpc ideal window, 
shows that our procedure for mock extraction works well. 



6 COSMIC MACH NUMBER STATISTICS 

6.1 Mach statistics for DEEP mocks 

Using the MLE weighting scheme (Sec. [5f, we estimated 
the bulk flow moments {u x ,u y ,u z }, the velocity dispersion 
a and the cosmic Mach number M for each of the 4100 
DEEP mock realizations. In Fig. [3] we show the probability 
distribution for the 4100 DEEP mocks: bulk flow u (left- 
hand panel), dispersion a (middle panel) and the cosmic 
Mach number M (right-hand panel). We found the rms bulk 
flow to be a v — 222 ±86 km s _1 with a velocity dispersion of 
a = 511T98 km s" 1 . Together this implies M = 0.43T0T7 
at lcr CL. Since the DEEP mocks have a characteristic depth 
of R = 35/i _1 Mpc, we can say that for the LD cosmology, 
the expected Mach number on scales of R = 35/i _1 Mpc is 
M = 0.43 ±0.17. 



6.2 Mach statistics for Gaussian Realizations 

In order to find the expected Mach number as a function of 
scale R for the LD cosmology, we went to the same cen- 
tral points for each of the 4100 DEEP mocks and com- 
puted the weighted average of the velocities of all the galax- 
ies in the simulation box, the weighting function being 
g — r /2R -yyrg re p ea t ec j tfu S for a range of scales between 
R — 10 — 100ft _1 Mpc in increments of 5/i _1 Mpc. We sum- 
marize the expected values for the bulk, dispersion and Mach 
number for scales R = 10 - 100/i _1 Mpc in Table |] In 
Fig. [4j we show the expected values for the bulk, disper- 
sion and Mach number (dashed line) together with their lcr 
CL intervals. The corresponding values for the 4100 DEEP 
mocks are shown by a solid circle at the characteristic scale 
R = 35h- 1 Mpc. 

The expected bulk (a v — 234 ± 94 km s _1 ), dispersion 
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Bulk, u (km/s) Dispersion, a (km/s) Mach, M 

Figure 3. Histograms showing the normalized probability distribution for the 4100 DEEP mocks: bulk flow u (left-hand panel), dispersion 
a (middle panel) and the cosmic Mach number M (right-hand panel). We also superimpose the best-fitting Maxwellian (for bulk and 
Mach) and Gaussian (for dispersion) distributions with the same widths as the corresponding histograms. The rms values and the la 
CL intervals are mentioned within each panel. These results correspond to the LD cosmology (see Table nf. 
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Figure 4. The rms values of the bulk flow (left-hand panel), dispersion (middle panel) and the cosmic Mach number (right-hand panel) 
are plotted as a function of scale R. In each panel, the dashed line corresponds to measurements from the Gaussian realizations, whereas 
the shaded region is the la CL interval. The solid circle at R = 35/i -1 Mpc is the result for the DEEP mocks. The error bar is the 
statistical variance of the mean calculated from the 4100 DEEP mocks. Linear theory predictions are shown by the solid line. These 
results correspond to the LD cosmology. The nonlinear contributions to the dispersion are clearly seen in both the center and right 
panels. 




(a = 517 ± 56 km s" 1 ) and Mach number (a = 0.44 ± 17) 
for Gaussian window with R = 35/i _1 Mpc are in excel- 
lent agreement with the corresponding values for the DEEP 
mocks. This shows that the DEEP catalogue probes scales 
up to ~ R = 35ft _1 Mpc, and not R = 59/i _1 Mpc as one 
would have inferred from ~}2w n r„/ ~}2w n using the weights 
w n = l/{ai+ai). 

Linear theory predictions for the LD cosmology are 
shown by the solid lines in Fig. [4] The onset of nonlinear 
growth in structure formation at low redshifts boosts the 
velocity dispersion, causing linear theory to over-predict the 
Mach values. 

The probability distributions for u, a and M from the 
Gaussian realizations in the LD simulations are plotted in 



Fig. [5] for a range of Gaussian widths R. For clarity, we only 
show scales R = 15, 35, 55, 75 and 95/i~ 1 Mpc. 

As expected, the rms bulk flow (dispersion) is a de- 
clining (increasing) function of scale R (see Figs. [4] and [SJ. 
This can be readily understood from the ideal window func- 
tions in Fig. [2] Larger scales have narrower window func- 
tions in Fourier space. Only small scale modes (k oc 1/R) 
contribute to the rms bulk flow integral in Eq. [6j resulting 
in smaller bulk flow on larger scales. The dispersion inte- 
gral (see Eq. [8| gets most of its contribution from higher 
fc— values (k > 1/R) and gradually increases with narrower 
windows. Similar histogram trends were found by Suto. Con, 
& Ostriker ( 1992 1 from numerical simulations of a CDM uni- 
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Figure 5. The same as Fig. [3] but for Gaussian window with R = 15/i -1 Mpc (dotted), R = 35/i -1 Mpc (solid), R = 55h _1 Mpc (short- 
dashed), R = 75/i — x Mpc (long-dashed) and R = 95h _1 Mpc (dot-dashed). For clarity, instead of the histograms, only the best-fitting 
Maxellian/Gaussian distributions with the same widths as the corresponding histograms are shown. The rms values and the la CL 
intervals for R = 35/i _1 Mpc are listed within each panel, and are in good agreement with the corresponding values for the DEEP mocks 
(shown in Fig. [3J . Table [2] summarizes the results for Gaussian widths R = 10 — 100 /i _1 Mpc. 



6.3 Mach statistics for other mocks 

In the following we extend our analysis to include various 
different peculiar velocity surveys, specifically to show that 
our results are not dependent on any radial or angular dis- 
tributions, nor any distinct morphological types. We com- 
pared the Gaussian realizations with mocks (4100 each) cre- 
ated to emulate the radial selection function of the SBF 
(Surface Brightness Fluctuations) ( |Tonry et aT||2001| |, EN- 



EAR (Early-type Nea rby Galaxies) (|da Costa et al 



Bernardi et al. 2002 



2000 



Wegner et al.||2003[ ), SFI++, SNIa 
and SC peculiar velocity surveys. Note that the SC and 
SNIa surveys are also part of our DEEP compilation. The 



SFI++ (Spiral Field I-ban d) catalogue ( |Masters et al.|2006 
|Springob et al.| 2007, 2009| is the densest and most complete 
peculiar velocity survey of field spirals to date. We use data 
from |Springob et al.|[20Tj9| The sample consists of 2720 TF 
field galaxies (SFI++f) and 736 groups (SFI++g). 



Table 2. The rms values of the bulk flow (2nd column), velocity 
dispersion (3rd column) and cosmic Mach number (4th column) 
together with their la CL intervals for Gaussian windows with 
width R (1st column). These values are calculated for the LD 
cosmology (for the LD parameters, see Table [TJ. 
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(hr l Mpc) 
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V< a 2 > 
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487 ± 68 
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0.48 
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0.19 
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0.17 
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0.41 
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0.16 
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0.11 
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0.10 
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565 ± 29 


0.24 
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0.09 
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0.09 
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569 ± 26 


0.22 
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0.21 
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0.08 


100 


113 ± 


11 


572 ± 23 


0.20 
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0.07 
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In Fig. [6] left-hand panels, we show the window func- 
tions Wu of the bulk flow components for the SBF, ENEAR, 
SFI++g, SNIa, SFI++f, DEEP and SC catalogues (top to 
bottom row, respectively). The right-hand panels show the 
window functions for a subset of the corresponding mocks. 
Comparing the window functions of the real catalogues with 
those of the ideal ones (solid lines in the left-hand panels), 
we estimate the characteristic depths of the SBF, ENEAR, 
SFI++g, SNIa, SFI++f, DEEP and SC catalogues to be 
R = 10, 19, 20, 23, 30, 35 and 40/i _1 Mpc, respectively. The 
window functions for these depths are shown in the right- 
hand panels (dashed lines, top to bottom row). 

In Table [3] we summarize the results for the various 
surveys (column 1), where the surveys are listed in order of 
increasing characteristic depth R (column 2) (based on the 
window functions in Fig.|6] as described in Sec. [5}. The error- 
weighted depths J2w n r„/ ^2w n , where w n — l/{a„ + o'l) 
are listed in column 3, and are typically ~ 75% larger than 
R. For the mock surveys, the expected values for the bulk, 
dispersion and Mach number and their la CL intervals are 
summarized in columns 4 — 6. Columns 7 — 9 are computed 
using the real surveys. The quoted errors are calculated us- 
ing the measurement uncertainties a n of the nth galaxy of 
a survey. Comparing columns 6 and 9, the Mach estimates 
for all catalogues agree at ~1.5cr CL for the LD cosmology. 

Similar to Fig. [4] we show results for the SBF, ENEAR, 
SFI++g, SNIa, SFI++f and SC mocks in Fig. [7] Except for 
the SBF and SC catalogues, the results for the other cata- 
logues are a close match to their Gaussian counterparts. Our 



SBF mocks are deeper than the real SBF survey because the 
LD simulations are not dense enough to extract mocks with 
depths less than ~ R — 12/t -1 Mpc. This explains why the 
SBF window functions for the mocks (see Fig. |6j first row, 
right-hand panel) are narrower than the one for the SBF's 
depth of R — 10/i _1 Mpc. Narrower window functions de- 
crease (increase) our bulk flow (dispersion) estimates for the 
SBF mocks. For the SC mocks, the bulk flow (dispersion) 
gets excess (suppressed) contribution from smaller scales due 
to the extended tails of the window functions (see Fig. [6] 
row seven). The SC catalogue, with only 70 clusters, does 
not have a good sky coverage. The DEEP compilation, how- 
ever, has a much better sky coverage, and the results (see 
Fig.]?] solid circle) match those from R = 35/t -1 Mpc Gaus- 
sian mocks. We have included the results for the SBF and SC 
catalogues to specifically show that if the selection function 
of the real survey is not properly modeled, the predictions 
(in our case, based on Gaussian selection function) can be 
misleading. 

For reasonably dense and well sampled velocity surveys, 
like DEEP, SFI++f, and SFI++g, a close match between the 
mock and the Gaussian results shows that the Mach analysis 
for such catalogues is not overly sensitive to the selection 
functions of the individual mocks. As such, one can skip the 
step of extracting mock realizations of the observations from 
N-body simulations, and simply use Mach predictions based 
on Gaussian selection function e~ r ^ 2R with R set to the 
characteristic depth of the survey being studied. 
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Table 3. Peculiar velocity statistics for various surveys (1st column). For each survey. 4100 mocks were extracted from the LD cosmology 
(for the LD parameters, see Table [TJ. The characteristic depth R (2nd column) of the mock catalogues is estimated from the effective 
width of their window functions shown in Fig. [fi] For reference, the error-weighted depth w n r„/ w„ where w n = 1/(<t^ + cr 2 ), is 
listed in the 3rd column. The rms values of the bulk flow (4th column), velocity dispersion (5th column) and cosmic Mach number (6th 
column) together with their la CL intervals. Columns 7—9 correspond to the real surveys with the quoted errors calculated using the 
radial distance uncertainties. 
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473 ± 66 
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0.58 ± 0.21 
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240 ± 86 
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0.47 ± 0.15 
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503 ± 22 
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222 ± 86 


511 ± 65 


0.43 ± 0.17 


312 ± 61 
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0.70 ± 0.14 
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485 ± 43 
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520 ± 74 
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Figure 7. Similar to Fig. [4] including results for the SBF (open triangle), ENEAR (solid triangle), SFIH — hg (open square), SNIa (solid 
square), SFI++f (open circle), DEEP (solid circle) and SC (cross) mocks. The DEEP compilation includes the SC, SNIa, SMAC, EFAR 
and Willick surveys. 



7 MOVING BEYOND N-BODY 

SIMULATIONS: MACH PREDICTIONS 
USING PKANN 

In Sec. [6] we showed that for velocity surveys with low con- 
tamination from small scales, reasonably accurate predic- 
tions for the Mach number can be made by extracting mocks 
having a Gaussian radial profile e~ r ^ 2R , R being the char- 
acteristic depth of the survey being studied. 

A further simplification in the Mach analysis one can 
hope to achieve is to be able to predict M(R) as a function 
of scale R without resorting to N-body simulations. Running 
high-resolution N-body simulations, even in the restricted 
parameter space around 7-yr WMAP ( Komatsu et al.|2011 1 
central parameters, is beyond present day computing capa- 
bilities. It would be much easier and faster to explore the 
parameter space using a prescription for the matter power 
spectrum, and using Eq. [6] and Eq. [8] to predict the cosmic 
Mach number. So far, this has been possible by using linear 



theory. However, for linear theory results to be applicable, as 
mentioned in Sec.[l] one needs to correct for the nonlineari- 
ties in the observed velocity field. Any residual nonlinearity 
can still bias the Mach predictions. 

In this section, we attempt to predict M(R) using 
PkANN ( Agarwal et al. 2012a I — a neural network interpola- 



tion scheme to predict the nonlinear matter power spectrum 
up to k < 0.9/iMpc -1 between redshifts z — — 2. Although 
PkANN accuracy worsens (starts under-predicting the non- 
linear spectrum for k j> 0.9/tMpc -1 ), we do not attempt to 
correct this by smoothing the velocity field over the relevant 
spatial scale. In Fig. [SJ we replace linear theory predictions 
shown in Fig. [7] with the ones calculated using PkANN 
for the LD cosmology. PkANN (solid lines) gives an excel- 
lent match with the N-body results (dashed lines) on all 
scales, showing PkANN can substitute numerical simula- 
tions for the purpose of calculating the Mach number given 
a set of cosmological parameters. Although, we have shown 
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Figure 8. Similar to Fig. [7] but instead of showing linear theory predictions, we plot predictions based on the nonlinear matter power 
spectrum for the LD cosmology estimated using PkANN. 



Pk ANN's performance for only the LD cosmology, it is ex- 
pected to perform satisfactorily for cosmologies around 7- 
yr WMAP central parameters for which PkANN has been 



specifically trained. See Agarwal et al. (2012a I for details on 
the parameter space of PkANN's validity. 



MACH NUMBER ESTIMATES FROM REAL 
CATALOGUES 



Ma, Ostriker, & Zhao (20121 measured Mach number for 
four peculiar velocity surveys (SBF, ENEAR, SNIa and 
SFI++f ) and found that the ACDM model with 7-yr 
WMAP parameters is mildly consistent with the Mach num- 
ber estimates for these four surveys at 3a CL. However, 
as the authors mention in their work, their estimates are 
based on using linear approximation for the power spec- 
trum. Given the fact that at low redshifts structure forma- 
tion has gone nonlinear on small scales, it is necessary to 
consider nonlinearities when making theoretical predictions. 
Comparing Figs. [7] and [8] (middle panels), one can see that 
dispersion is significantly boosted by nonlinearities, lowering 
the Mach predictions (third panels) by la level. 

Further, they work with Tophat window functions in 
their analysis. A Tophat filter assumes a volume-limited sur- 
vey with a sharp edge in real space. However, the number 
density of objects sampled in a real survey typically fall at 
large distances. Real surveys thus have a narrower depth 
than what a Tophat would suggest. The sharp edge of a 
Tophat creates both ringing and extended tails in fc-space. 
Since it is the small scale modes that are most contaminated 
by nonlinearities at low redshifts, a Tophat filter leads to 
aliasing of small-scale power onto larger scales. As such, a 
Tophat filter is a poor choice if one wants to isolate the 
contribution from small scales. 

It is worth mentioning here that using a Gaussian win- 



dow function W (kR) = e 



over-damps the high-fc 



tails associated with a Tophat. The reason being that we 
only observe the line-of-sight component of the velocity field, 
whereas the equations presented in Sec. [2] are based on the 



full 3D velocity measurements. The line-of-sight component 
extends the tails of the survey window functions in fc-space 
(see Grinstein et al.||l987 Kaiser 1988). This is the reason 
why in our analysis, we do not use W 2 (kR) = e~ A R ; in- 
stead, we compute the ideal window functions using only the 
line-of-sight information (see Eq. |17[ ). The extended tails of 
the ideal window functions can be seen in Fig. IS] and should 
be contrasted against W (kR) 



Ma, Ostriker, & Zhao (20121 estimated the character- 
istic depth of these surveys using ^ w n r„/ ^ w n where 
w n = l/(<7n + crj). Specifically, they found depths of 
16.7, 30.5, 30.7 and 50.5/i _1 Mpc for the SBF, ENEAR, SNIa 
and SFI++f, respectively. However, from Fig.[6jand Table|3] 
(rows one, two, four and five), we show that these surveys 
probe scales of ~ R = 10, 19, 23 and 30/i _1 Mpc, respec- 
tively. Using linear theory with Tophat filters, and neglect- 
ing the survey window functions while estimating the effec- 
tive depths, makes the bulk flow (and any derived) statistic 
highly complicated to interpret. 

In Sec. |6.3| we used numerical simulations to study the 
Mach statistic for SBF, ENEAR, SFI++g, SNIa, SFI++f, 
DEEP and SC mocks. In this section, we calculated the 
Mach number using the real catalogues themselves. The 
results are shown in Fig. [9] and summarized in Table [3] 
columns 7 — 9. We find the Mach observations lie within 
~ 1.5a interval for a ACDM universe with LD parameters. 
The high uncertainty in the Mach number for the SC cata- 
logue we attribute to its poor sky coverage. 



9 DISCUSSION AND CONCLUSIONS 

The estimates of bulk flow and dispersion on scale R are 
subject to observational errors stemming from the accuracy 
levels of distance indicators used, and the survey geometry. 
Typically, the velocity power spectrum is smoothed using 
Tophat or Gaussian filters, with results depending on the 
exact smoothing procedure used. Often, bulk flow results 
are quoted and inferences drawn about our cosmological 
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Figure 9. Similar to Fig. [8] but instead of showing the Mach numbers for the mocks, we plot the Mach numbers for the real surveys. 
The error bars are calculated using the radial distance uncertainties. 



model, without paying much attention to the survey win- 
dow functions which are essential in determining the scales 
that contribute to quantities derived from peculiar veloci- 
ties. A statistic such as the cosmic Mach number can be a 
useful tool to test theories of structure formation, provided 
the observational uncertainties are accounted for and the 
scale is properly determined. 

In this paper, we studied the statistical distribution 
of Mach number by extracting mock realizations of the 
real peculiar velocity catalogues from LasDamas numerical 
simulations. We showed that the Mach number estimates 
from the real catalogues agree with the expectations for a 
ACDM universe at ~ 1.5c level at the characteristic scales 
of the surveys. We checked if our Mach expectations de- 
rived from mock surveys were biased by the selection func- 
tion effects: we extracted realizations with a Gaussian pro- 
file f(r) oc e" r /2R and found no significant change to our 
Mach values for the mock surveys. 

We compared results from numerical simulations to 
show that theoretical prediction of Mach number based on 
linear theory of structure formation is inaccurate. Specifi- 
cally, small-scale nonlineraties increase velocity dispersion, 
thereby lowering the Mach predictions by about la for 
a ACDM universe with WMAP type cosmology. We pre- 
sented an alternative method to study the cosmic Mach 
number - by using a prescription for the nonlinear mat- 
ter power spectrum, instead of running time-consuming and 
computationally-intensive numerical simulations. Nonlinear 
power spectrum interpolators like PkANN offer tremendous 
leverage over numerical simulations, by being able to explore 
the parameter space quickly. The role of such interpolating 
schemes in the study of quantities derived from peculiar ve- 
locities needs further investigation. Also, in the future we 
plan to employ MV-like formalism to study this statistic, 
reduce the nonlinear signal to below the statistical errors 
and thus create a truly linear Mach number statistic that 
can be used to directly compare results from disparate sur- 
veys as a function of the volume chosen and probed. 
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